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Abstract 

In QFT, the null energy condition (NEC) for a classical field configuration is usually 
associated with that configuration's stability against small perturbations, and with the sub- 
luminality of these. Here, we exhibit an effective field theory that allows for stable NEC- 
violating solutions with exactly luminal excitations only. The model is the recently in- 
troduced 'galileon', or more precisely its conformally invariant version. We show that the 
theory's low-energy 5-matrix obeys standard positivity as implied by dispersion relations. 
However we also show that if the relevant NEC-violating solution is inside the effective the- 
ory, then other (generic) solutions allow for superluminal signal propagation. While the 
usual association between sub-luminality and positivity is not obeyed by our example, that 
between NEC and sub-luminality is, albeit in a less direct way than usual. 

1 Introduction, and relation to previous work 

Of all energy conditions of general relativity (GR), the 'null' one stands out as the best defined — 
thus the most likely to follow from fundamental properties of well-behaved physical systems. The 
reason is that the null energy condition (NEC), 

T^u n^^n" > for all null n'^'s , (1) 

is saturated by the cosmological constant contribution's to T^j^^. It is therefore impossible to 
violate the NEC by adding a suitably negative c.c. to an healthy system, or conversely, to obey 
it by adding a large positive c.c. to an otherwise negative-energy system. This is not the case for 
other energy conditions. One could postulate that the latter be obeyed by the "non-cosmological 
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constant part" of T^^,, but the splitting between the cosmo logical constant and everything else is 
at best non-local in field space and thus in real space — in the sense that it cannot be performed 
on the basis of local observations — and more likely ill-defined. 

1.1 A no-go theorem. . . 

Since the value of the c.c. does not affect the behavior of a theory at distances smaller than 
the curvature radius — and at all if we decouple gravity — it is on the NEC that we should focus 
to look for a generic relation with good properties of healthy physical systems. In fact it is 
notoriously difficult to violate the NEC while retaining stability. Indeed, it has been shown in 
ref. |1] that for a generic effective theory of scalar fields (f)^ 0, the NEC follows from stability 
and isotropy, or alternatively from stability and sub-luminality. Explicitly, given the low-energy 
effective Lagrangian as a derivative expansion, 

£ = £(0^90^990^...) , (2) 

at low energies one can restrict to the lowest-derivative order — at most one derivative per field — 
and ignore higher derivative terms, 

C ^ C{<P\B'') , B'' = d^cp'd^<p' . (3) 

Then one can relate the derivatives of the Lagrangian w.r.t. the metric to the derivatives of the 
Lagrangian w.r.t. the scalars' derivatives. That is, there is a very direct relation between a generic 
solution's stress-energy tensor, and the kinetic action for small perturbations of the same solution. 
One can thus show that if a solution violates the NEC, either (i) it is unstable, or (ii) if it is 
stable, both its stress-energy tensor is anistropic, and some of its perturbations are super-luminal. 
Conversely, if a solution is stable and has an isotropic stress-energy tensor, or if it stable and 
admits sub-luminal excitations only, then it obeys the NEC [I]. 

The instabilities we are concerned about are those associated with negative kinetic or gradient 
energy for the fiuctuations, rather than with negative mass terms [§. This is because such insta- 
bilities plague all short-wavelength fiuctuations, down to the UV cutoff of the theory. The decay 
rate is thus extremely fast, dominated by the shortest scales in the theory, and cannot be reliably 
computed within the effective theory. Conversely, a standard tachyon-like instability, like Jeans's 
or that associated with the negative mode of a true-vacuum bubble, is dominated by modes with 
momenta of order of the tachyonic mass scale, which can be parameterically smaller than the UV 
cutoff. 

Now, the theorem summarized above crucially relies on neglecting higher-derivative contribu- 
tions to the Lagrangian. Is this justified? Usually it is, for higher-derivative terms are irrelevant 
at low energies and, perhaps more to the point, if they do become important the derivative expan- 
sion is expected to break down — with an infinite tower of even higher-order terms all becoming 

^Such a system is quite general. For instance it describes the low-energy dynamics of perfect fluids and solids 

^For background solutions that break Lorentz-invariance spontaneously, the definition of stability may be subtler. 
See ref. [1] for a discussion about this point. 
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important at the same time. At the level of the classical e.o.m., higher- derivative terms signal 
the presence of massive ghost-like degrees of freedom, thus demanding that the effective theory 
break down at energies below their masses and effectively making the higher-derivative terms 
unimportant within the regime of validity of the effective theory. 

1.2 . . .with exceptions 

However there can be exceptions to this general logic. The first exception is provided by the 
ghost-condensate [2]. There, the fluctuations' Lagrangian is degenerate (or nearly so) at the 
lowest-derivative level, and as a consequence certain higher-derivative terms can be important 
at low energies, below the cutoff of the theory. This is consistent with the derivative expansion 
because the background solution — the 'condensate' — spontaneously breaks Lorentz invariance. It 
is then higher spatial derivative terms that become important for the fluctuations' dynamics. 
They do not introduce any new ghost-like degrees of freedom at the classical level, and being the 
leading source of gradient energy, they change the scaling dimensions of higher-order operators 
thus keeping the derivative expansion under control f2||. Indeed the ghost-condensate was used in 
ref. [3] to construct a cosmological solution that violates the NEC while being stable and isotropic, 
and having (highly) sub-luminal excitations only. 

Here we consider a second exception — that provided by the galileon [Ij. The model involves a 
scalar vr enjoying a shift-symmetry on itself as well as on its first derivatives, 

71 ^ TV + C + , (4) 

with constant c and 6^. This is reminiscent of the invariance of non-relativistic mechanics under 
galilean transformations x{t) —>■ x{t) + xq + v^t — hence the name of the model. Now, galilean 
invariance (jl]) forces each tt in the e.o.m. to be acted upon by at least two derivatives. Absence of 
ghosts requires instead that there be no more than two derivatives per field. There is therefore a 
preferred set of higher- derivative terms in the Lagrangian — those that yield purely two-derivative 
equations of motion for tt. Such terms have been classified in ref. [1] . In four spacetime dimensions 
there are four: the standard kinetic term 

A = -\{d7^f , (5) 

a cubic interaction 

£3 = -i(97r)2nvr , (6) 

and quartic and quintic ones 

£4 ~ dTTdTl{d^Tlf , £5 ~ dTTdTl{d'''Kf , (7) 

with suitable Lorentz contractions H. 

Now, if we stick to this particular set of interactions, at the classical level no new ghost- 
like degree of freedom shows up in the non-linear regime — by construction. This is a necessary 

•^Formally, the tadpole tt should also be added to this class However in its presence the trivial configuration 
TT = is not a solution. We will therefore set it to zero. 
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condition for the quantum effective theory not to break down for sizable classical non-linearities. 
In fact the model exhibits remarkable properties at the quantum level as well. First, thanks to 
galilean invariance, the Lagrangian terms we discussed do not get renormalized upon inclusion 
of quantum corrections, nor do the latter generate terms with fewer derivatives, like e.g. {dir)^ 
[S]. Second, with a consistent, if not generic choice of counter-terms, our effective theory can be 
extrapolated deep into the non-linear regime of classical solutions. Essentially, galilean invariance 
allows for a robust disentaglement of large classical non-linearities from large quantum effects. See 
ref. [6] for details and discussions about this point. 

In summary, the galileon provides us with a classically as well as quantum-mechanically consis- 
tent higher-derivative Lagrangian, which can potentially evade the no-go theorem reviewed above. 
Analyzing this possibility is the goal of our paper. 

2 The NEC-violating solution 

For practical purposes and for reasons that will become clear below, it is more convenient to work 
with a slight deformation of the theory. We can promote the Poincare group supplemented by 
the galilean transformations (jl]) (a fifteen-dimensional sjnnmetry group overall) to the conformal 
group S0{4:, 2) [4]. Indeed, if we think of the conformal group as the subgroup of diffeomorphisms 
that leaves the fictitious metric 

<?W(x)^e2-(-)r^,. (8) 
conformally flat, then we recognize in eq. (jl]) the small- vr limit of infinitesimal dilations 

7r(x) ^ 7r(Aa;) + logA , (9) 

and of infinitesimal special conformal transformations 

7r(x) 7r(x + {bx^ - 2(6 ■ x)x)) - 2b^x^ . (10) 

Then one can show that for each galilean invariant term, there is a corresponding conformally 
invariant one that reduces to that for small tt. For instance the kinetic and cubic terms dSl [6]) get 
promoted to 

A -le'^idn)' (11) 
A - -'^{dnyUn - lidn^ . (12) 

Such conformally invariant 'completion' is most easily constructed by taking suitable curvature 
invariants for the metric ([8]), as described in ref. [1]. Working with the model's conformal version 
makes finding interesting highly symmetric solutions and studying their properties particularly 
easy. For instance, there is a deSitter-like solution where the fictitious metric (IHl) describes (part 
of) deSitter space: 

e^ds = L -oo < t < . (13) 

Hot' ^ ^ 
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Such a solution spontaneously breaks the conformal group 5*0(4,2) down to one of its maxi- 
mal subgroups, the deSitter group 5*0(4, 1). The 'curvature' scale Hq depends on the model's 
parameters. Starting from a generic linear combination of our Lagrangian terms 

C = C2C2 + ■■■ + C5jC, , (14) 

where with an abuse of notation by we refer to the conformal versions of the galilean invariants 
([5H7]), Hq is a solution to the algebraic equation [1] 

( - 2c2 + 3C3 - 3c4 + f C5 = . (15) 

Notice that, since we work with the dimensionless field vr, the lagrangian coefficients are dimen- 
sionful: c„ oc E*^"^". To make conformal invariance explicit, one could conveniently define a 
dimensionful dilaton field = Me'^, where M is an arbitrary mass scale, and write the lagrangian 
in terms of (f) and dimensionless couplings c„ = c„M^"~^. The action parameterized this way 
would not depend explicitly on M, thanks to scale invariance. In section 3 we shall conveniently 
pick M = Hq. Here we prefer to use the somewhat redundant notation with dimensionful c„ for 
basically two reasons. First we want to make contact with the non-conformal galileon limit. Sec- 
ond, the role of conformal invariance in our dicussion is mostly technical: to simplify computations 
and to deliver a simple globally defined solution around which to expand. The lesson about NEC 
violation is more general. 

Now, let us consider fiuctuations about eq. (fT3l) . They are exactly luminal, because of the 
high degree of symmetry: by construction the full action only involves the fictitious metric ([H]) 
and its curvature invariants. For the background solution we are expanding about, the latter are 
proportional to the former, since the background is maximally symmetric. Therefore, although 
the solution f|T3l) spontaneously breaks Lorentz invariance, the only background tensor structure 
appearing in the fiuctations' action is r]^,^. In particular, it is easy to see that the quadratic 
Lagrangian for the fiuctuation (/? = tt — vTds is 

= d2j^, HH',t\d^r - 1{-4H',)^'] , (16) 

where the overall normalization factor ^2 is |1] 

d2 = C2- 3C3 + |C4 - 3C5 . (17) 

The kinetic term is fixed by the above symmetry considerations, and its normalization by the local 
analysis of ref. [4]. The mass term is also enforced by symmetry. The background vr configuration 
spontaneously break time-translational invariance, t ^ t + e. The fiuctuation (p is the associated 
Goldstone boson, which has to non-linearly represent the broken symmetry: 

(^^(^ + 7r^g(t) -6 = V5- ^ . (18) 

The variation of the kinetic term under an infinitesimal transformation is non-zero: 

H-'.i^(9'pf)-^a,^9>'i = ^-*'p^, (19) 
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where in the last step we integrated by parts. Then the kinetic term has to be accompanied by a 
mass term that compensates for this variation, 



^i--2H^^V)=e-m'^j^^, (20) 

which happens precisely for = —4Hq. This is obviously the same logic that for spontaneously 
broken internal symmetries ensures the masslessness of the associated Goldstone bosons. 
Notice that while the fluctuation's kinetic energy is positive definite for 

o?2 > , (21) 

the mass term always has the wrong sign. However its size is of order of the typical time variation 
rate of the background solution. As a consequence ascertaining the solution's stability is subtler 
than for a time-independent one. In fact the very concept of stability is somewhat ambiguous for 
a time-dependent solution (see a related discussion in ref. [6]). As to short wave-length perturba- 
tions, the solution is certainly stable, for the fluctuations' kinetic energy is positive, and the mass 
term is negligible at high frequencies. For longer wavelengths, we can tentatively define stability 
in the following way. For given initial conditions, the fluctuation's equation of motion yields two 
solutions. At zero momentum, one of these will always be the Goldstone solution: 

V'Goldstone = '^'dsi^) ' ^ ~ i (22) 

with constant e. //this is the leading solution at late times (t — 0^, in our case), we say that the 
background solution vTds is stable. Indeed in such a case any long-wavelength small departures 
from the background solution will be diluted away at sufficiently late times, and we will be left 
with the background solution itself, slightly translated in time. This is hardly an instability. It 
is straightforward to check that for the Lagrangian fll6p the Goldstone solution (1221) is indeed the 
leading one at late times. 

In summary, the deSitter solution is stable against small perturbations if d2 defined in eq. f fT7|) is 
positive. Also perturbations are exactly luminal, thanks to the solution's high degree of symmetry. 



2.1 The stress-tensor 

It remains to be determined whether such a solution can violate the NEC. To answer this question, 
we have to compute the stress-energy tensor. 

For a generic tt configuration the computation is straightforward, but painful due to the higher- 
derivative structure of the Lagrangian. However for our deSitter solution f|T3|) things drastically 
simplify, as we now show. The reader not interested in the details of the computation can jump 
directly to eq. flS^ . which expresses the condition on the coefficients Cj for the solution to violate 
the NEC. 
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All conformal-invariant operators are of the form [1] 
This is just a consequence of scale-invariance, which is unbroken by the solution: 

g27rds(i) _^ g27rds(At)+21ogA ^ g27rds(t) (^25) 

Therefore 

T'^^ = r'^'^^ , (26) 

with a constant and isotropic t'^". Now, conservation of T'^'^ for an homogeneous solution in flat 
space is simply 

which implies = 0. We are left with the pressure T*-' = p6'^^ , which we can get from the trace 

3p = T% , (28) 
which in turn we can compute by varying the action just w.r.t. the conformal mode of the metric: 



^9, 



where 



6ip 



(29) 

1,3=0 



9,Ax)=e'^^'^V,.^9lV{x). (30) 

Now we can use the fact that S.,^ is built out of curvature invariants involving the fictitious 
metric ([H]), where vr itself appears as the conformal mode. More precisely, suppose we compute 
the Ricci tensor for the metric e'^'^g^^, with g^j^^, generic. We get 

R^, [e^^g] = R^, [g] + R^, [e^^r/, g] , (31) 

where by the last term we denote the Ricci tensor for the metric ([8]), "covariantized" with the met- 
ric g^i, — precisely what makes up upon minimal coupling. Actually, to perform the contractions 
appearing in we have to raise one index with the metric e^'^gi^.y. 

R^Ae'^g] = e-'-R^M + e-'^g^^^R^^ie^^v, 9] , (32) 

where by R^i^[G] we are denoting G^'^Rau[G]. As we mentioned, minimal coupling corresponds to 
the replacement 

S^ = j d''xe^^F{R%[e^^T^]) j d^x ^ e^^ F [e-^^ g^'''R^,[e^^7], g]) 

= j d''xy/^ge'^F{R\[e^-g] - e-^^R^M) (33) 



which for the conformally flat metric fl30|) . at hnear order in ip becomes 



dF 



^^[vr + (/^, 0] + j d^x ip{x) (2 9^9, + 5^n) (^^^ 



dF 



(34) 



On a solution of the tt eom, the first term has vanishing variation w.r.t. y?, and we are left just 
with the second: 



OF 



(35) 



The above expression holds for any solution. However in our deSitter case we can simplify this 
expression even further. For the deSitter solution, 



It is straightforward to convince oneself that 



OF 



1 dF{R^^\) 

12 " dm 



(36) 



(37) 



ds "'-'-'0 

Then derivatives in eq. ( l35i) only act on the e^'^ factor, and we finally get 

3 dF{Rds^,) 



Hlt^ dHl 



(3^ 



In summary, to compute the pressure, we just have to express the action of tt in terms of curvature 
invariants of the metric ([8]), plug in the deSitter solution, and then derive w.r.t. to H^. 



2.2 The problem with £3 



If we apply eq. (1551) to £3, we run into a problem. In order to express £3 in terms of curvature 
invariants, on top of Ft' and {R^uY we need the operator [3] 



(R 



^)^ 



R' 



d-1 {d-iy 



(39) 



which diverges when d A for generic metric, but is regular for conformally fiat metrics g^i, 
e^'^rj^y, for which it reduces to 



d'^xe^"-'^^^ 



2{d- 



^□7r(97r)^ + ^(5vr)^ + (nvr)^ 



(40) 



Now, the problem is that to get from eq. fl5^ to eq. fHUl) . we integrated by parts. This amounts to 
neglecting a boundary term, whose coefficient diverges for d ^ 4. This is fine for asymptotically 
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trivial vr's, but for our deSitter solution we have to keep track of the boundary term. Indeed 
eq. fl39|) diverges when computed on a deSitter solution, even though the latter is conformally flat. 
The bottom line is that if we want to apply eq. fl38|) to an action containing £3, we must be more 
careful about boundary terms. 

We can bypass this problem by treating £3 separately, since the computation of its stress- 
energy tensor is straightforward anyway: 

Jd'xCs T^3) = \[2d^'Kd,7rn7r-{d^Trd,{divf + d,7Td^{dny)+7]^,d^7Td''{d7iy] 

+ \[4{d7r)%7rd,iT-7],,{d7ry] . (41) 

[In computing the first line, which comes from eq. ([6]), care has to be taken in varying the □, 
because it contains a covariant derivative. However this is straightforward to do if we rewrite Dtt 

In particular, the trace T^^^^^^ computed on the deSitter solution e^'^'^^ = l/H^t'^ is 



T^^^% = 71- (42) 



3 

Now we can subtract the £3 contribution from the action, and apply eq. flM|) to the remainder. 
The first term in eq. (IMI) . which would vanish on-shell for the full action, now can be rewritten 
as minus the e.o.m. contribution coming from £3. Combining everything we thus get 

where the last term is the same as eq. fl55]) . applied to the non-^3 part of the action. As to second 
term, for the deSitter solution it is 



which exactly cancels the first term. In conclusion, eq. fl38|) holds on-shell if we restrict it to the 
non-£3 part of the action: 

_ 3 rfF(-^)(i?ds^) 
^ Hlt^ dHl ^ ' 

2.3 The recipe 

We are now in a position to check whether there exists a choice for the Lagrangian coefficients 
C2, . . . , C5 that makes our deSitter solution (fT3|) violate the NEC without giving up stability. (As 
we stressed, absence of superluminality is automatic for this particular solution.) The various 
contributions to 

= C2C2 + C4C4 + (46) 
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computed on the deSitter solution are [1] 



/:2 = 

U = 

= 

We thus get 



J2^ 



16 



93 



39 
4-62 



32 -"O 



Irpfj, 

3^ 



1 



[-C2 + '-fc,H^ + lc,H^,] 



(47) 
(48) 

(49) 

(50) 
(51) 



where in the second step we used the e.o.m. ( [T5|) . To violate the NEC, since the energy density 
vanishes on the de Sitter solution, it is enough to have negative pressure, that is 

-lc,-lcsH^o + ^c,H^>0. (52) 

On the other hand, for the de Sitter solution to be stable, we need ( 12T|) . that is 

- 3c2 + 3c3 - |c4 H^>0, (53) 

where we used the e.o.m. ( 1151) . Notice that the e.o.m. (fT5|) is the only one of our conditions where 
C5 appears. We can thus think of it as determining C5 for given Hq and C2, C3, C4. Furthermore, 
we can eliminate by absorbing it into a redefinition of 02,03,04. That is, for the moment we 



can work in H, 



Q — 1 units and reintroduce Hq later by dimensional analysis. Then, combining 

(54) 



eqs. (I52l) and fl53l) we get a condition on C4: 

3|3(2C2 + 3C3) < C4 < 2(C3 - C2 



This interval exists if and only if 

C3 > I ■ C2 • (55) 

So, summarizing, here is the recipe for a system that has a stable NEC-violating, de Sitter-like 
solution: 

1. Pick Hq > for the deSitter-like solution; 

2. Pick C2; 

3. Pickc3>|-ifc2/i/o'; 

4. Pick C4 in the interval 313(202 + 3c3 H^) < C4 ifg < 2(c3 — C2) ; 

5. C5 is fixed by the e.o.m.: C5 Hq = — 1(— 2c2 + 3c3 Hq — Sc^ Hq) ; 

6. The desired action is S*^ = / d'^xc2C-2 + ■ ■ ■ + c^C^ ■ 

Notice that if we want the trivial configuration tt = to be stable, so that we can think of 
our theory as a standard relativistic QFT constructed about a (perturbatively) stable Poincare 
invariant vacuum state, then at point 2 above we need C2 > 0. This is compatible with everything 
else, however. 
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3 The quantum theory 



So far we focused on classical aspects of our solution. However we also have to impose that it 
make sense quantum-mechanically — that is, that it be within the regime of validity of our effective 
theory. At first sight, this is not difficult to achieve. The constraints derived in the previous section 
are linear in CnHQ^^~'^\ and thus can be generically satisfied by choosing 

2 P 

C2 = f , Cn>2 = an 2{n-2) ' (^6) 

with arbitrary and with a„ = 0(1). Now, appears as an overall multiplicative constant 
in the Lagrangian, and therefore its value cannot affect the classical solutions. On the other 
hand, quantum-mechanically it formally plays the role of 1/h. The bigger the smaller the 
quantum mechanical effects. Indeed this is made manifest by going to canonical normalization for 
tt: ttc = / TT. Then all self- interact ions for tTc are suppressed by powers of Hq and of /. By keeping 
Hq fixed and taking / ^ Hq, we can make quantum effects arbitrarily weak while keeping a given 
classical solution unchanged. 

Still, the limit / ^ Hq entails not quite an inconsistency — but a peculiarity of our model. 
The problem lies in the softness of 2 — > 2 scattering amplitudes at low energies, as we now 
explain. It has been pointed in ref. [7], that for an effective theory to be the low-energy limit 
of a relativistic microscopic theory obeying the standard properties of the S'-matrix (analyticity, 
unitarity, Froissart bound), the 2 — 2 forward scattering amplitude for identical particles cannot 
go to zero faster than at low energies — and the coefficient of the term has to be positive. Now, 
the galileon grossly violates this, before the conformal completion we adopted. This is because 
each galilean interaction involves too many derivatives per field (see eqs. (I6|, [7])). And in fact at 
tree-level the 2^2 amplitude vanishes exactly in the forward limit {t 0) — by power counting 
it should behave like s^, but crossing symmetry in the forward limit implies parity under s —>■ —s. 

The conformal completion can in principle fix this, because it supplements the galileon La- 
grangian with interaction terms with fewer derivatives per field ||4j. For instance with £3 there is 
an associated (c^vr)'^ term, which has exactly the right structure to yield an contribution to the 
scattering amplitude. On the other hand, the conformal version of £4 has too many derivatives, 
and that of £5 has to many vr's (as well as too many derivatives), so we can neglect them. So, the 
part of the Lagrangian that can contribute an piece to the forward scattering amplitude is (see 
eqs. 

= -I C2 e'^dTrf - i C3 Dnidny - i C3 (8^)' 

- -\C2 {d^f + \c,{d^f + . . . , (57) 

where we got the last line after suitable non-linear field redefinitions, and the dots stand for higher 
orders in tt and derivatives thereof. 

We thus see that with our conformally invariant Lagrangian, we can have a positive term 
in the low-energy forward 2^2 scattering amplitude, 

M{s,t^Q) r^%s^ + ... , (58) 

^2 
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for positive C3. Notice that C3 > is consistent with our 'recipe' of sect. 12. 3[ The strong couphng 
scale associated with this term in the amphtude is 



A={cl/c,Y^'r.(fHoy/'. (59) 

However, the strong couphng scales associated with higher-order terms can be lower. For instance 
the cubic interaction Dn{dnY used twice in two trilinear vertices connected by a propagator yields 
an + + term (which vanishes in the forward limit but is non-zero otherwise) suppressed 
by the scale 

A'=(c^/c^)'''- (60) 

This is much lower than A for / ^ Hq. More systematically, expanding the tree-level Lagrangian 
in powers of the canonicalUy normalized field tTc, we get interactions of the form 

jCn,m,, ~ dn,dn,{nJfy{dnjHofY"'{ddnjH^jy , (61) 

with 1 < m + g < 3 and arbitrary n, except for m = 0, g = 1, in which case n = 0. The associated 
strong-coupling scales are 

n+2m + g 

An,m,g = Ho{f/Ho)^^^ . (62) 

For f ^ Hq the lowest of such scales corresponds to = m = 0, 

Astrong = Ao,o,g = H^i^f / HqY^^ , (63) 

which is exactly the scale A' of eq. (HI]) . 

Now, the fact that the leading low-energy term in the amplitude is weighed by a scale that is 
much higher than that weighing higher-order terms is not inconsistent, but suggests that the UV- 
completion of the model, if it exists, is weakly coupled. In such a case, the different strong-coupling 
scales A„_m,g do not correspond to any physical scales in the theory — the theory never becomes 
strongly coupled — they are just low-energy mirages arising from taking different combinations of 
a weak coupling and of the true new physics mass-scale of the theory. This is evident for instance 
in the tree-level exchange of a weakly coupled massive particle at low energies: 

(as we will see our case suggests a slightly subtler interpretation than this simple example). Now, 
because of scale invariance ([9]) and of eq. fl56l) . the structure of our Lagrangian is 

C = f e^" dnd-K F {e-^dn/HQ, e-^"" ddn / H^) (65) 

where the function F is dimensionless, and has no small parameter. Then, it is natural to conjec- 
ture that the derivative expansion be controlled by the scale 

(j){x) = e"(^)i/o ■ (66) 
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That is, if we rewrite our classical Lagrangian as 



9' ^ Hl/p , 



(67) 



it is natural to interpret (7 as a coupling constant — weak, in the limit ^ Hq — and as a 
dilaton controlling the mass-scale where the UV-completion enters. If this is correct, then the 
weakly coupled limit g ^ 1 indeed has formally very high strong coupling scale, but this says 
nothing about where the effective theory is actually breaking down. New physics has to enter at 
the scale 0. For the trivial solution tt = we have = Hq, and the effective theory is valid at 
energies below this. For our deSitter solution (IT^ we have 



and the solution itself is outside the effective theory, in the sense that all terms in F above are of 
the same order, thus signaling a breakdown of the derivative expansion. 

We want to stress that, in fact, we are not forced to this conclusion. If F were an infinite 
series — the full derivative expansion — then indeed identifying the UV-completion's mass scale 
would be unambiguous. For instance one could compute the 2^2 tree-level scattering amplitude 
at all orders in about a = const solution. Then the radius of convergence of that series — 
the distance of the first non-analyticities from the origin — would define the scale of new physics. 
Perhaps more physically, the strong-coupling scale associated with a very high-order term would 
be very close to 0, the extra factor of g being diluted by a tiny fractional power (see eq. 
In our case instead F is a finite polynomial, and the identification of the UV-scale is not as 
straightforward. For (7^1 quantum effects are small, and the effective theory is unitary up to 
energies of order of the strong-coupling scale Agtmng = Hq/ g^/^. 

We can however put an upper bound on the UV-completion's scale by means of dispersion 
relations, and such a bound turns out to be parameterically smaller than the strong coupling 
scale — in agreement with the idea that the UV-completion be weakly coupled. If we assume 
analyticity, unitarity, and that the forward 2^2 scattering amplitude A{s) = A^(s,t 0) is 
bounded by at large s, then we have [7] 



where a{s) is the total cross-section for vrvr scattering. With our effective theory we can compute 
the left-hand side, and the low-energy contribution to the right-hand one. Since the integrand is 
positive, this equation then yields an upper bound Auv on the scale where we should stop trusting 
our low-energy computations — the cutoff of the effective theory. About the trivial solution vr = 0, 
(j) = Hq, the l.h.s. can be got from eq. (l58ll : 



1 

'dS — 777 ) 



(68) 




(69) 



X(0) 




(70) 



r2 H^ ■ 
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On the other hand the low-energy cross-section is UV-dominated (as befits a derivatively coupled 
theory), and in particular at energies above Aqq^^/Aq^q = Hq it is dominated by the piece in 
the amplitude, thus scaling as 

<s) ~ g'^, . (71) 

We thus see that in order not to violate eq. fl69|) . the low-energy contribution has to be cut off 
below 

Auv = ^ . (72) 

This defines the highest possible energy scale our effective theory can be extrapolated to (under 
the assumptions given above). The UV-completion has to enter below this scale. Notice that 
Auv is parameterically higher than (p = Hq, which was the cutoff scale 'naturally' suggested by 
the structure of eq. (1671) . This makes our deSitter-like solution consistent with the standard S- 
matrix properties of well behaved relativistic theories. Notice also that Auv is lower than the 
strong-coupling scale, 

Astrong = g^/-^ ' (73) 

as it should be for the consistency of our estimate. 

In general there may be infinitely many more dispersion relations that can constrain further 
the structure of the low-energy effective Lagrangian, and set an upper-bound on its cutoff lower 
than eq. (!72|) . For instance one could consider n ^ n scattering amplitudes with n larger than 
two. Such constraints are going to be quite cumbersome to work out, and are beyond the scope 
of the present paper. On the other hand, in the case of weakly coupled UV-completion further 
information can be extracted from the forward 2 — > 2 amplitude A{s) [7]. We will not take this 
route either — in fact it is straightforward to show that in our case nothing new is learned on the 
cutoff of the theory by taking it. Instead, we will consider signal propagation in the presence of 
weak background fields. This will turn out to be surprisingly powerful. 



4 Retaining causality 

Ref. [7] identified a class of derivatively coupled theories for which ^'-matrix positivity along 
the lines of last section implies the sub-luminality of excitations in the presence of non-trivial 
background fields. In our case, as we will see in a moment, quite the opposite is true: amplitude 
positivity forces C3 to be positive, which in turn makes certain excitations about background fields 
s«]9er-luminal, unless the cutoff of the theory is low enough. We will use this to constrain the cutoff 
of our effective theory. Notice that the analyses of refs. [H H] already exhibited the presence of 
super-luminal excitations for DGP-like and galileon theories about certain background solutions. 
We find it useful however to provide a self-contained similar analysis here, mainly for two reasons. 
The first is that we will exhibit superluminality for essentially any static weak-field solution to the 
field equations — which eliminates the possibility that perhaps superluminality about a particular 
solution just signals that the field cannot be coupled to sources in such a way as to produce that 
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solution. The second reason is that we will show that superluminality is gone if the UV cutoff of 
the theory is of order (0). 

To study the propagation of fluctuations about a background solution, it is more convenient 
to use (p = Hqc^ as field variable. The Lagrangian becomes (see eq. dSZ])) 



9^ 



4 



(74) 



where g = Hq/ f, = c^Hq/P, and the dots stand for higher-order terms, coming from £4 and 
£5. As clear from last section's discussion, the dimensionless coefficient C3 has to be positive, and 
of order unity. Now, consider a background solution 0o(x), and small fluctuations ip about it. At 
quadratic order the fluctuation Lagrangian is 



C = 1 

„2 



1 
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^G^''d^ipd,ip + ... , (75) 



where the dots stand for non- derivative terms (i.e. mass terms) for ip, as well as for contributions 
from the terms already neglected in eq. (17^ . and we defined the effective inverse metric G^'^ as 

...„r(...^-.«-.,(^?!|^-.^) 



The reason why we focus on the kinetic action for ip and neglect non- derivative terms, is that the 
causal structure is defined by the highest-derivative terms: according to Leray's theorem [8], for a 
Lagrangian of the form ( l75l) . ip excitations travel along the light-cone of the effective metric G^^- 
We are thus led to analyze the light-cone of the effective metric ( 1761) in the presence of a 
non-trivial (poi^)- To be as general as possible, we consider a generic weak vacuum solution 0o(a;). 
By 'weak' we mean that the non-linearities of our action are negligible. Also, for simplicity we 
restrict to a static configuration, so that the solution obeys 

VVo - . (77) 

The physical situation we have in mind is that we have stationary, localized sources for 0, and 
that away from the sources the field is in its linear regime. We generically expect there to be a 
stationary point for 0o 0. We zoom in on that point, and choose it as the origin of our coordinate 
system. The solution can be Taylor expanded about the origin 

0o(f)^$o(l + ^A,,^) , (78) 



"^This is guaranteed if the source arrangement is minimally inventive. Also such an assumption is not necessary 
to proceed, but makes the discussion neater. Indeed a space gradient term, corresponding to a term linear in in 
eq. (|78p . gives just a negative definite contribution to Gij in eq. (|80|) . making the light cone even wider. 
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where $o is the field value at the origin, L is the typical scale over which the solution varies, and 
Aij is a dimensionless matrix with entries of order unity and which, because of Laplace's equation 
(I77|) . is traceless. As we will see, this last property is crucial for our argument. 

Now, to impose, as we assumed, that the solution be in a linear regime, we have to demand 

L > l/<l>o . (79) 

Indeed the size of non-linearities in eq. fl7^ for the terms explicitly kept is of order 9^/</)^, which 
for a configuration like flTHjl is precisely l/(L$o)^- The neglected terms in eq. (17^ are suppressed 
by even higher powers of d"^ / cjP' ~ l/(L$o)^ — that is why we are allowed to neglect them in the 
first place. Not surprisingly, the size of non-linearities l/(L$o)^ is also controlling the corrections 
to 7]^^ in the inverse metric fl75]) . As long as this is small, the latter can be straightforwardly 
inverted to yield the effective metric 

G,.„ + 2c, (^-2^^). (80) 

where we neglected the correction proportional to 7]^^, because it does not affect the light-cone 
aperture. That is, at this order we can factor it out of the metric, as an overall conformal factor, 
which is irrelevant for our question. 

To check whether the light-cone of G^y includes directions that are super-luminal with respect 
to the true Minkwoski light-cone defined by r]^^, we contract G^^ with a generic null direction 

= {l,n). If n'^ turns out to be time-like for G^i, at least for some direction n, we know that (p 
signals can travel superluminally along n. For the moment we consider small distances from the 
origin, x <^ L, so that the denominators in (IHOjl can be taken as constant for our solution. We 
have 



The second term in brackets is always a negative definite matrix, being minus the tensor product 
of A ■ X with itself. It thus gives a superluminal contribution in all directions — that is, it tends to 
make n'^ time-like w.r.t. G^^, regardless of n. However at small distances from the origin, x <^ L, 
the second term is negligible with respect to the first. Now, Aij, being traceless, has at least one 
negative eigenvalue. The corresponding eigenvector, defines a direction fi for which the first term 
also gives a superluminal shift in the propagation speed, of order 

(recall that both £3 and A^j are of order one.) We are thus led to the conclusion that a generic 
linear solution ^0(3;) allows for superluminal excitations, because the effective metric perturbations 
propagate in has a light-cone that is broader than that defined by r]^^. We stress that this result 
does not depend on the detailed form of the coupling of to each individual source. For instance 
on whether it couples through a 'monopole' 0J, or via dipole d^(j)J^ or even via higher multipoles. 
This generalizes the proof of superluminality given in ref. [3] by considering spherical solutions 
induced by monopole coupling. 
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The one we just obtained is a purely classical result: at the classical level, to ascertain the 
existence of superluminal signals, it is enough to analyze the effective metric (Leray's theorem). 
Upon canonical quantization the causal structure does not change [9j, but since we have an effective 
theory with a cutoff, to be consistent we have to demand that the superluminal effect we isolated 
be measurable within the effective theory. The superluminal shift f l52]) in the propagation speed 
entails superluminal propagation of actual signals if, for a mode of given momentum k propagating 
from the origin all the way to L, the gain over a strictly luminal signal is at least of order of one 
wavelength. That is, we need 

k 1 

AcA;L~— — — >1. (83) 

We are restricting to a region of size L around the origin, because that is where our approximations 
hold. We will comment below on what happens outside this region. Recall that for the solution to 
be linear, we want L$o larger than one. We thus see that the superluminal effect is unobservable 
if we restrict to fluctuations with momenta smaller than $o- Therefore, if we postulate that the 
cutoff of our effective theory is the local value of 0, like the structure of our Lagrangian (|6711 
naturally suggested, we cannot consistently conclude that the causal structure of the theory is 
different than the usual relativistic one. 

Nothing new is learned by relaxing the x <^ L restriction. For a generic linear solution, with a 
generic arrangement of sources, the structure of the solution will deviate significantly from eq. fl78l) 
beyond x ~ L, and full knowledge of the solution will be needed to conclude anything about signal 
propagation. However since eq. flTSl) is a good vacuum solution for x ^ L as well — non-linear 
effects are down by powers of L^/ ($o^^) — one could study how signals propagate on it for distances 
much longer than L. The problem is that the superluminal effect is proportional to the leading 
non-linearity, and so the former also rapidly goes to zero at very large x. That is, for a solution 
like eq. (|78l) the superluminal effect is concentrated in a region of size L around the origin. 

In summary, if we assume that our effective theory arises as the low-energy limit of a micro- 
scopic theory with the standard relativistic causal structure, we are forced to bound its regime of 
validity by the local value of 0. Under those assumptions our stable NEC-violating solution of 
sect. |2]is therefore untenable. 



5 Back to the amplitude 

In hindsight, it is natural to ask whether the 5- matrix encodes any signs of the superluminal effect 
we unveiled. In simpler examples, like those considered in ref. [7j, sub-luminality of excitations 
about background fields is tied to positivity of forward 2 —>■ 2 amplitudes at order s^. In our case 
instead, as we discussed, such positivity is obeyed so long as C3 is positive, yet we have super- 
luminality regardless of cs's sign. In fact, our super- luminal effect relies on the cubic interaction 
□7r(97r)^ whose contributes to 2 — > 2 amplitudes is of order and vanishes in the forward limit, 
being proportional to + + u^. It thus looks like that in order to isolate the term responsible 
for super-luminality, we need to consider non-forward amplitudes. 

Some form of positivity still holds for non- forward 2 — > 2 amplitudes Ai{s, t). Indeed following 
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Martir@ |10j . unitarity together with standard properties of Legendre polynomials imply that for 
real s 

— Im M{s + ie, 0) > Vn , (84) 

which is a generalization of the optical theorem, ImA^(s + ^e, 0) oc (Xtot > 0. //there is a mass- 
gap, Ai{s + i€,t) is analytic in the t-plane, for \t\ below thresholcl^. This means that we can 
Taylor-expand Ai{s + ie,t) in t about the origin, and use the inequalities to conclude 

lmM{s + ie,t) > for t > , (85) 

so long as t (now real and positive) is below threshold. We can then run a standard dispersion 
argument (see e.g. ff\) — this time using analiticity in the s-plane — and show that 



d"^ 2 f 

—M(0,t) = - dslmM(s + ie,t) 

OS'' TT J 



(s + tf 



> for t > , (86) 



where the integral is taken along the cut on the positive real axis, from the threshold to infinity. 

Now, in our case there is no mass-gap — the particles we are scattering are themselves massless — 
and we have non-analyticities at arbitrarily low s and t. However, we already argued that the 
UV-completion of our model must be weakly coupled, with coupling A -C 1. We can then consider 
the scattering amplitude at tree- level in A. Such an amplitude has non-analiticities associated 
with the exchange of the UV-completion's massive states, but it does not have low-energy cuts 
associated with multi-massless particle states — these enter at higher order in A [7j- It thus makes 
sense to apply the above dispersion relation to the tree-level amplitude, even in the absence of a 
mass gap. The mass-scale playing the role of the gap is then the UV cutoff of the effective theory, 
where the UV-completion enters and the first non-analiticities show up. 

Now, in our case we assumed that the cutoff of the theory is parameterically higher than Ho, 
perhaps as high as eq. (1721) . At intermediate energies, Hq <^ E <^ Auv, the amplitude is dominated 
by the 0{s^) terms, and we can neg lect the C(s2) ones. The former gets a contribution from the 
Lagrangian term we are trying to isolate, □7r(97r)^, but also directly from the quartic galilean 
interaction contained in £4. Indeed we can perform a non-linear field redefinition that eliminates 
the cubic interaction and yields a quartic term of the form 

d-nd-n ddn ddn , (87) 

which is precisely the structure of £4. Therefore, unless the £4 contribution accidentally vanishes, 
which it does not, we already see that our attempt to connect super-luminality with violation of 
amplitude positivity, fails: amplitude positivity can be fixed by a suitable choice of C4, which as we 
argued does not affect signal propagation in the presence of weak background fields. However, if 
we want our Lagrangian to describe NEC- violating stable solutions, C4 is not completely arbitrary. 
It may be that once we impose the conditions of sect. 12.31 positivity turns out to be violated. 



^For a recent application see also [TT] . 

^More precisely, as proven in ref. [101, analyticity in (s,t) holds on the product space {\t\ < M) x (cuts plane). 
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Computing the 0{s^) tree-level amplitude is straightforward. The result is 

3 1 

Mis,t) = -—{cl-2c2Ci)st{s + t). (88) 

4 C2 

Positivity in the form of eq. (15^ is obeyed iff 

cl>2 C2C4 . (89) 

Such an inequality is in fact implied by our conditions of sect. 12. 3[ as manifest from the following 
simple algebraic manipulation: 

< (C3 - 2 C2/H^y = 4 -A{c^C2/Hl - cl/H^) < - 2C2C4 , (90) 

where in the last step we used item 4. of sect. 12. 3[ 

In conclusion, by assuming that our effective theory can be extrapolated to energies parameter- 
ically higher than Hq we find no contradiction with amplitude positivity, even in the non-forward 
case. Conversely, absence of superluminal excitations demands the theory to be cut off at Hq. 

6 Discussion 

Our effective theory allows for an isotropic NEC-violating solution that is stable against small 
fluctuations. Moreover, signal propagation on this solution is exactly luminal. This is a nice 
counterexample to the theorem of ref. which here is violated because one of the hypotheses, 
subtly, is. The relevant part of the Lagrangian formally involves higher- derivative terms, yet the 
corresponding field equations are second-order. 

Still, if the goal was to find a completely sensible relativistic QFT that can violate the NEC 
in the semi-classical regime, then we failed. From the standard EFT standpoint, our theory is 
perfectly sensible, and the relevant NEC solution is within its regime of validity. Moreover, our 
theory obeys standard positivity of 2 — 2 amplitudes, even in the non-forward case — which is a 
stronger requirement than purely low-energy consistency [7]. Nevertheless, if we start from the 
vacuum and excite a generic weak field configuration, small excitations about this background will 
now propagate super-luminally. The presence of super-luminal excitations signals that the causal 
structure of the theory is not the standard Lorentz-invariant one. As a consequence our effective 
theory cannot arise as the low energy limit of a Lorentz-invariant UV-complete local QFT, for 
which micro-causality has to hold as an exact (i.e., non-perturbative) operator statement [Tl[9j. 

Now, we can avoid super-luminality if the cutoff of our theory is in fact much lower than 
expected. However for such a low cutoff, our NEC- violating solution is outside the effective theory 
and thus cannot be trusted. That is: either (i) we believe that the cutoff is high enough to make 
room for the NEC-violating solution, but then we have to accept the existence of super-luminal 
excitations about other solutions, also inside the effective theory; or (ii) we believe that the cutoff 
is low enough to forbid superluminal propagation of signals, but then our NEC- violating solution 
is gone as well. Essentially, absence of superluminal excitations demands that all non-linearities be 
small: if the cutoff of the theory is the local value of (p, as required by sub-luminality, then all non- 
linear terms in eq. fl^TI) have to be small for the solution to be within the effective theory, despite 
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what suggested by purely low-energy EFT considerations and even by dispersion relations. This 
makes the tension we found particularly robust. It is not an accident of our particular solution, 
but rather it is a property of any solutions that crucially rely on non-linearities. Since the NEC 
is obeyed by the free part of the Lagrangian, all NEC-violating classical solutions are outside the 
regime of the effective theory. 

We thus see that there still is a strong tie linking the NEC and subluminality. It is less direct 
than in the absence of higher-derivative terms, where it is the NEC- violating solutions themselves 
that support superluminal excitations pLj , but it appears to be equally tight. It would be interesting 
however to understand whether this relation is an accident of our Lagrangian, or whether instead it 
is a more general, and thus deeper, fact. For instance, we adopted a conformal modification of the 
galileon theory in order to respect positivity at order in the scattering amplitude. However this 
requirement by itself does not single out the conformal group 5*0(4, 2) as the symmetry group of 
our theory. We chose it because then with such a high degree of symmetry, finding very symmetric 
solutions and studying their properties is immediate. However there are probably other equally 
convenient choices — for instance the 5D Poincare group ISO{A, 1) [4J — and in principle nothing 
stops us from lowering the number of symmetry generators, as long as galilean invariance is an 
approximate symmetry in the UV. Note that for superluminality to happen, one just needs a non- 
vanishing galilean cubic term. In our theory cs has to be strictly positive for the NEC-violating 
solution to be well behaved (sect. 12.31) and for the forward amplitude to be positive at order 
(sect. [3]). It is entirely possible that disentangling the coefficient of the cubic galilean term from 
others' could eliminate superluminality altogether while retaining the good features of our theory 
as well as of the NEC-violating solution. 

From a more general field theoretical perspective, we find it interesting that in our theory 
relativistic dispersion relations involving the 2 — > 2 scattering amplitude are not enough to discover 
superluminality in the presence of sources. In the simple cases of ref. [7], the link was so direct 
that forward dispersion relations were enough to imply sub-luminality about any background 
solution. In our case instead, even non-forward ones are obeyed by a theory with superluminal 
excitations. We expect that at some level of complexity (for instance one could consider n — >■ n 
scattering amplitudes, with n > 2) dispersion relations should encompass the requirement of 
subluminality about any background fields. Indeed the sources turning on these backgrounds may 
be viewed as fictitious ones — used to construct the generating functional W[J] from which the 
n-point functions {4>{xi) . . .0(x„)) are then straightforwardly derived. It is clear then, that the 
propagation properties of fiuctuations about some background field are encoded in these n-point 
functions, and thus in the S'-matrix elements. It would be interesting to understand for a generic 
theory the 5- matrix equivalent of the sub-luminality requirement. 
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